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RAW SEQUENCE LISTING 
PATENT APPLICATION US/09/207, 161 




3/22/2 
TIME: 11:50:12 



INPUT SET: S3 51 14. m 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



i 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 
46 



SEQUENCE LISTING 



(1) 



General Information: 



(i) APPLICANT: Hillman, Jennifer L . 

Goli, Surya K . 



ENTERED 



(ii) TITLE OF INVENTION: NOVEL HUMAN INTEGRAL MEMBRANE 

PROTEIN 

(iii) NUMBER OF SEQUENCES : 4 

(iv) CORRESPONDENCE ADDRESS: 



(A 
(B 
(C 

(D 
(E 
(F 



(v) COMPUTER READABLE FORM: 



(A 
(B 
(C 
(D 

(vi) 
(A 
(B 
(C 

( vii 
(A 
(B 

( vii 
(A 
(B 
(C 

(ix) 
(A 
(B 



ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE: CA 

COUNTRY: USA 

ZIP: 94304 



MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 
APPLICATION NUMBER: 09/207,161 
FILING DATE: . s 
CLASSIFICATION: 

PRIOR APPLICATION DATA: 
APPLICATION NUMBER: US/08/791,338 
FILING DATE: 

) ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER: PF-0208 US 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-416 6 



(2) INFORMATION FOR SEQ ID NO : 1 : 



PAGE: 2 RAW SEQUENCE LISTING DATE: 03/22/2000 

PATENT APPLICATION US/09/207,161 TIME: 1 1 :50: 12 

INPUT SET: S35114.mw 

47 

48 (i) SEQUENCE CHARACTERISTICS: 

49 (A) LENGTH: 266 amino acids 

50 (B) TYPE: amino acid 

51 (C) STRANDEDNESS : single 

52 (D) TOPOLOGY: linear 
53 

54 (vii) IMMEDIATE SOURCE: 

55 (A) LIBRARY: NEUTGMT0 1 

56 (B) CLONE: 632664 
57 

58 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

59 

60 Met Val Lys Val Thr Phe Asn Ser Ala Leu Ala Gin Lys Glu Ala Lys 

61 1 5 10 15 

62 Lys Asp Glu Pro Lys Ser Gly Glu Glu Ala Leu lie lie Pro Pro Asp 

63 20 25 30 

64 Ala Val Ala Val Asp Cys Lys Asp Pro Asp Asp Val Val Pro Val Gly 

65 35 40 45 

66 Gin Arg Arg Ala Trp Cys Trp Cys Met Cys Phe Gly Leu Ala Phe Met 

67 50 55 60 

68 Leu Ala Gly Val lie Leu Gly Gly Ala Tyr Leu Tyr Lys Tyr Phe Ala 

69 65 70 75 80 

70 Leu Gin Pro Asp Asp Val Tyr Tyr Cys Gly lie Lys Tyr lie Lys Asp 

71 85 90 95 

72 Asp Val lie Leu Asn Glu Pro Ser Ala Asp Ala Pro Ala Ala Leu Tyr 

73 100 105 110 

74 Gin Thr lie Glu Glu Asn lie Lys lie Phe Glu Glu Glu Glu Val Glu 

75 115 120 125 

76 Phe lie Ser Val Pro Val Pro Glu Phe Ala Asp Ser Asp Pro Ala Asn 

77 130 135 140 

78 lie Val His Asp Phe Asn Lys Lys Leu Thr Ala Tyr Leu Asp Leu Asn 

79 145 150 155 160 

80 Leu Asp Lys Cys Tyr Val lie Pro Leu Asn Thr Ser lie Val Met Pro 

81 165 170 175 

82 Pro Arg Asn Leu Leu Glu Leu Leu lie Asn lie Lys Ala Gly Thr Tyr 

83 180 185 190 

84 Leu Pro Gin Ser Tyr Leu lie His Glu His Met Val lie Thr Asp Arg 

85 195 200 205 

86 lie Glu Asn lie Asp His Leu Gly Phe Phe lie Tyr Arg Leu Cys His 

87 210 215 220 

88 Asp Lys Glu Thr Tyr Lys Leu Gin Arg Arg Glu Thr lie Lys Gly lie 

89 225 230 235 240 

90 Gin Lys Arg Glu Ala Ser Asn Cys Phe Ala lie Arg His Phe Glu Asn 

91 245 250 255 

92 Lys Phe Ala Val Glu Thr Leu lie Cys Ser 

93 260 265 
94 

95 (2) INFORMATION FOR SEQ ID NO : 2 : 

96 

97 (i) SEQUENCE CHARACTERISTICS: 

98 (A) LENGTH: 1114 base pairs 

99 (B) TYPE: nucleic acid 




PAGE: 3 RAW SEQUENCE LISTING DATE: 03/22/2000 

PATENT APPLICATION US/09/207,161 TIME: 1 1 :50: 12 

INPUT SET; S35114.mw 

100 (C) STRANDEDNESS: single 

101 (D) TOPOLOGY: linear 
102 

103 (vii) IMMEDIATE SOURCE: 

104 (A) LIBRARY: NEUTGMT0 1 

105 (B) CLONE: 632664 
106 

107 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
108 

109 GCCGCCTCTG CCGCCGCGGA CTTCCCGAAC CTCTTCAGCC GCCCGGAGCC GCTCCCGGAG 60 

110 CCCGGCCGTA GAGGCTGCAA TCGCAGCCGG TGAGCCCGCA GCCCGCGCCC CGAGCCCGCC 120 

111 GCCGCCCTTC GAGGGCGCCC CAGGCCGCGC CATGGTGAAG GTGACGTTCA ACTCCGCTCT 180 

112 GGCCCAGAAG GAGGCCAAGA AGGACGAGCC CAAGAGCGGC GAGGAGGCGC TCATCATCCC 240 

113 CCCCGACGCC GTCGCGGTGG ACTGCAAGGA CCCAGATGAT GTGGTACCAG TTGGCCAAAG 300 

114 AAGAGCCTGG TGTTGGTGCA TGTGCTTTGG ACTAGCATTT ATGCTTGCAG GTGTTATTCT 360 

115 AGGAGGAGCA TACTTGTACA AATATTTTGC ACTTCAACCA GATGACGTGT ACTACTGTGG 420 

116 AATAAAGTAC ATCAAAGATG ATGTCATCTT AAATGAGCCC TCTGCAGATG CCCCAGCTGC 480 

117 TCTCTACCAG ACAATTGAAG AAAATATTAA AATCTTTGAA GAAGAAGAAG TTGAATTTAT 540 

118 CAGTGTGCCT GTCCCAGAGT TTGCAGATAG TGATCCTGCC AACATTGTTC ATGACTTTAA 600 

119 CAAGAAACTT ACAGCCTATT TAGATCTTAA CCTGGATAAG TGCTATGTGA TCCCTCTGAA 660 

120 CACTTCCATT GTTATGCCAC CCAGAAACCT ACTGGAGTTA CTTATTAACA TCAAGGCTGG 720 

121 AACCTATTTG CCTCAGTCCT ATCTGATTCA TGAGCACATG GTTATTACTG ATCGCATTGA 780 

122 AAACATTGAT CACCTGGGTT TCTTTATTTA TCGACTGTGT CATGACAAGG AAACTTACAA 840 

123 ACTGCAACGC AGAGAAACTA TTAAAGGTAT TCAGAAACGT GAAGCCAGCA ATTGTTTCGC 900 

124 AATTCGGCAT TTTGAAAACA AATTTGCCGT GGAAACTTTA ATTTGTTCTT GAACAGTCAA 960 

125 GAAAAACATT ATTGAGGAAA ATTAATATCA CAGCATAACC CCACCCTTTA CATTTTGTGC 1020 

126 AGTGATTATT TTTTAAAGTC TTCTTTCATG TAAGTAGCAA ACAGGGCTTT ACTATCTTTT 1080 

127 CATCTCATTA ATTCAATTAA AACCATTACC TTAA 1114 
128 

129 (2) INFORMATION FOR SEQ ID NO: 3: 
130 

131 (i) SEQUENCE CHARACTERISTICS: 

132 (A) LENGTH: 263 amino acids 

133 (B) TYPE: amino acid 

134 (C) STRANDEDNESS: single 

135 (D) TOPOLOGY: linear 
136 

137 (vii) IMMEDIATE SOURCE: 

138 (A) LIBRARY: GenBank 

139 (B) CLONE: 624778 
140 

141 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
142 

143 Met Val Lys He Ala Phe Asn Thr Pro Thr Ala Val Gin Lys Glu Glu 

144 15 10 15 

145 Ala Arg Gin Asp He Glu Ala Leu Val Ser Arg Thr Val Arg Ala Gin 

146 20 25 30 

147 He Leu Thr Gly Lys Glu Leu Arg Val Val Pro Gin Glu Lys Asp Gly 

148 35 40 45 

149 Ser Ser Gly Arg Cys Met Leu Thr Leu Leu Gly Leu Ser Phe He Leu 

150 50 55 60 

151 Ala Gly Leu He Val Gly Gly Ala Cys He Tyr Lys Tyr Phe Met Pro 

152 65 70 75 80 



PAGE: 4 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/207, 161 



DATE: 03/22/2000 
TIME: 11:50:13 



176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 
190 
191 
192 
193 
194 
195 
196 
197 
198 
199 
200 
201 
202 
203 
204 
205 



INPUT SET: S35U4.raw 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 624777 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GGGAGACCTG 
CAAGCAAGCG 
CCTTACCGCG 
GTGCAAAAGG 
CAAATCCTGA 
AGATGCATGC 
GCCTGCATTT 
TTTGATTCTG 
ACTGAGGAGG 
AGTTTCTCTG 
TACCTGGACT 
ACTCCAAAGA 
ACTTATGTGG 
GGTATTTTTA 



AGCTCGCTGC 
CGATCCTCTC 
CAGCCCGAAG 
AGGAGGCGCG 
CTGGCAAGGA 
TTACTCTCCT 
ACAAGTACTT 
AGGATCCTGT 
CTGATATCCG 
ATAGCGATCC 
TGCTTTTGGG 
ATCTGGTGGA 
TTCGTGAAGA 
TTTACCAACT 



TGCCTGTGGA 
CTCTTTCAAC 
ATTCACCATG 
GCAAGATATA 
GCTCAGAGTT 
AGGCCTCTCA 
CATGCCCAAG 
CAATTCCATT 
TGAGGATGAC 
GGCGGCAATT 
AAACTGTTAT 
ACTTTTTGGA 
CCTGGTTGCT 
TTGCAACAAC 



AGACTGGGAG 
CTGCAGCCCA 
GTGAAGATCG 
GAGGCGCTCG 
GTCCCGCAGG 
TTCATCTTGG 
AGCACCATTT 
CCTGGAGGAG 
AACATTGCCA 
ATTCACGACT 
CTGATGCCCC 
AAACTGGCAA 
GTGGAAGAAA 
CGAAAATCCT 



AGGAGACACT 
AGATACTGAT 
CCTTCAACAC 
TCAGTCGCAC 
AGAAAGATGG 
CAGGACTGAT 
ACCATGGTGA 
AGCCATACTT 
TCATTGATGT 
TTGAGAAGGG 
TCAATACTTC 
GTGGCAAGTA 
TTCGTGATGT 
TCCGCCTTAG 



AAGTGCTGCT 
TCGAGCCGCG 
CCCTACGGCG 
TGTCCGAGCT 
CTCATCTGGG 
TGTTGGTGGA 
GATGTGCTTC 
TCTGCCTGTG 
GCCTGTGCCC 
AATGACTGCT 
CATTGTTATG 
TTTGCCTCAT 
TAGTAACCTT 
ACGCAGAGAC 



60 
120 
180 
240 
300 
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540 
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720 
780 
840 



PAGE: 5 RAW SEQUENCE LISTING DATE: 03/22/2000 

PATENT APPLICATION US/09/207,161 TIME: 1 1 :50: 13 

INPUT SET: S3S114.raw 

206 CTTCTGCTGG GTTTCAACAA GCGTGCCATT GACAAATGCT GGAAGATTAG ACACTTCCCC 900 

207 AATGAATTTA TCGTTGAAAC CAAGATCTGT CAGGAGTGAA ATGTGACAGA TAAAGAGTAT 960 

208 CCTTGATAAT AAGAAGTCAG GAACTTACCG TCTGACTTGG AAAATTGAAA TTGATGGGAT 1020 

209 ACTCATGCTA TTTACTCATA CATTTACTCT ATTGCTTATA CTGGAAAAGG AAAGGGAAAG 1080 

210 GGGGGAGAAA ACTACTAACC ACTGCAAGCG ATTGTCCAAT TCTACTTTAA TTGACATTGC 1140 

211 TTGCTGTTTT CAACAAGTCA AATGATTATC TTTTCTCTTG AATTTATAGG GTTTAGATTT 1200 

212 CTGAAAGCAG CATGAATGTG TCATCTTACC ATCCTGACAA TAAAGCCCAT CCTCTGGTTT 1260 

213 TATTTAAAGC AAGCTCTTTC CAACATCACT TGGCTAGAGC ATGCTTTAAA TTTAAAATAT 1320 

214 TTGAAATTTG TTTTTGACAT TTTTTTGTGT GAAACATGTC AAATCTCTTA CCATTCTTTG 1380 

215 GTTTTCTTCT TTATTATGTT CAACTCTCCT GATTTCAGAA GTTACATTTT TGCATTTCTA 1440 

216 TCAGGTGCTG TGTAACGAAT CTGACTGATA TGTGAACAAT CTTCATGAGG AAGCAATTTT 1500 

217 TTACTCATGT AATGATTCTT TCTCACTGAT ATCTGTATTG TGAAATCCAC AGAACTGTAC 1560 

218 AGGTGCTGAA TGCTGTAAGG AGTTCTGGTT GTATGAATTC TACAACCCTA TAATAAAGTT 1620 

219 TACCGTATTC AATCA 1635 
220 

221 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/09/207,161 



DATE: 03/22/2000 
TIME: 11:50:13 



INPUT SET: S3SU4.raw 

Original Text 



